Especially, when recognizing names, we use Viterbi arithmetic to confirm the maximal probability of context state information sequence in a sentence, and combine the local statistic of text to match the recognizable models including compellation, placename and translated term. 其中,在对名字的识别过程中,我们采用了Viterbi算法,确定句中概率最大的上下文信息状态序列,并结合文本的局部统计量,对人名、地名、译名进行匹配识别;
Text Categorization involves a wide range of applications such as text modeling, category arithmetic, feature selection and term weight setting. 文本分类问题涉及到文本模型表示、特征选择、分类算法以及权重设置等多种方法的综合应用,需要针对不同的文本集合特点选择合适的分类方案。
Proposing to adaptively encode run length with 3 symbol's arithmetic coding in term of it's binary form, which can be generalize any codec using run length. 提出将游程值按照二进制的形式用3个模型的算术编码实现自适应的编码,可以应用到任何现有的采用游程编码的编、解码器中;
A solution step in theoretical course knowledge is nest expressed using the arithmetic operators such as SUM, TERM and FIELD, some complicated FIELD arithmetical operators simplified by variable O which can improve the accurateness and effectiveness of homework correction. 本文提出了将学科理论知识中的解答步骤用SUM、TERM、FIELD等算子进行嵌套表示,对一些较复杂的FIELD还采用O变量进行化简,从而保证了识别的效率和正确性。